首页> 外文OA文献 >AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Opponents
【2h】

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Opponents

机译:真棒:融合的一般多智能体学习算法   自我发挥并学习对抗固定对手的最佳反应

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A satisfactory multiagent learning algorithm should, {\em at a minimum},learn to play optimally against stationary opponents and converge to a Nashequilibrium in self-play. The algorithm that has come closest, WoLF-IGA, hasbeen proven to have these two properties in 2-player 2-action repeatedgames--assuming that the opponent's (mixed) strategy is observable. In thispaper we present AWESOME, the first algorithm that is guaranteed to have thesetwo properties in {\em all} repeated (finite) games. It requires only that theother players' actual actions (not their strategies) can be observed at eachstep. It also learns to play optimally against opponents that {\em eventuallybecome} stationary. The basic idea behind AWESOME ({\em Adapt When Everybody isStationary, Otherwise Move to Equilibrium}) is to try to adapt to the others'strategies when they appear stationary, but otherwise to retreat to aprecomputed equilibrium strategy. The techniques used to prove the propertiesof AWESOME are fundamentally different from those used for previous algorithms,and may help in analyzing other multiagent learning algorithms also.
机译:令人满意的多主体学习算法应该至少{\ em}学习如何对静止的对手进行最佳比赛,并在自我比赛中收敛到纳什均衡。最接近的算法WoLF-IGA已被证明在2人2动作重复游戏中具有这两个属性-假设可以观察到对手的(混合)策略。在本文中,我们提出了AWESOME,这是第一种保证{\ em all}重复(有限)游戏具有这两个属性的算法。它只需要在每个步骤中观察其他参与者的实际行动(而不是他们的策略)。它还学会与{\ em最终成为}静止的对手进行最佳对抗。 AWESOME({\ em当每个人都平稳时适应,否则转移到平衡})背后的基本思想是,试图在其他人的策略静止不动时适应他们的策略,而后退到预先计算的均衡策略。用于证明AWESOME的属性的技术与以前的算法根本不同,并且可能有助于分析其他多主体学习算法。

著录项

  • 作者单位
  • 年度 2003
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号